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Abstract — 

Cooperative decision making is a vision of future network man- 
agement and control. Distributed connection preemption is an im- 
portant example where nodes can make intelligent decisions on allo- 
cating resources and controlling traffic flows for multi-class service 
networks. A challenge is that nodal decisions are spatially depen- 
dent as traffic flows trespass multiple nodes in a network. Hence the 
performance-complexity trade-off becomes important, i.e., how accu- 
rate decisions are versus how much information is exchanged among 
nodes. Connection preemption is known to be NP-complete. Cen- 
tralized preemption is optimal but computationally intractable. De- 
centralized preemption is computationally efficient but may result in 
a poor performance. This work investigates distributed preemption 
where nodes decide whether and which flows to preempt using only 
local information exchange with neighbors. 

In this work, we first model a large number of distributed 
preemption-decisions using a probabilistic graphical model. We then 
define the near-optimality of distributed preemption as its approx- 
imation to the optimal centralized preemption within a given error 
bound. We show that a sufficient condition for distributed preemp- 
tion to be optimal is that local decisions should constitute a Markov 
Random Field. The decision variables, however, do not possess an 
exact spatial Markov dependence in reality due to the flows passing 
through multiple links. Hence we study traffic patterns of flows, and 
derive sufficient conditions on flows for the distributed preemption 
to be near-optimal. We develop, based on the probabilistic graphi- 
cal models, a near-optimal distributed algorithm. The algorithm is 
used by each node to make collectively near-optimal preemption de- 
cisions. We study trade-offs between near-optimal performance and 
complexity that corresponds to the amount of information-exchange 
of the distributed algorithm. The algorithm is validated by both anal- 
ysis and simulation. 

Index Terms — Distributed preemption decision. Complexity, prob- 
abilistic graphical models. Probabilistic inference. 



I. Introduction 

A vision of future network management is to involve nodes to 
make intelligent decisions on allocating resources and controlling 
traffic flows. This includes admitting new flows by preempting 
less important existing flows, which is well studied in the policy 
based admission control (i.e., admission is based on the priority 
of flows) im ifTSl . Specifically, preemption is defined at a prior- 
itized multi-class network, where a new call needs to be set up 
with a high priority between a source (S) and a destination (D) [8J 
lfT3l ifTSi II25I ll27l . When the capacity is insufficient at all feasi- 
ble routes between the source-destination (S-D) pair, some existing 
flows of the lower priorities need to be forced to reduce their band- 
width, move to the lowest service class (e.g., best-effort-service), 
or simply preempted to accommodate the new call. Preemption 
decisions is to decide which lower priority flows to remove to free 
the reserved bandwidth for the new call at a chosen route JS] ll27ll . 



total preempted bandwidth can be minimal under such constraints 
as bandwidth demand of a new call and available free bandwidth 
at each linkfl 

The benefit of preemption has been described in the prior works. 
For example, preemption allows a new high-priority connection to 
access heavily crowded core networks, e.g., multi-protocol label 
switched (MPLS) networks [30 1. Connection preemption also im- 
proves resource utilization by allowing low-priority flows to ac- 
cess unused bandwidths (TT\ (3U\. Preemption sees potential ap- 
plications in emerging networks. For example, in 802. lie Wire- 
less LAN, delay sensitive IP packets in expedited forwarding (EF) 
class can be served earlier than the best-effort packets through pre- 
emption |26|. Multi-level preemption and precedence (MLPP) is 
proposed to classify calls by their importance, which can be used 
for military as well as commercial networks fl]. 

There are two significant challenges for preemption which are 
performance and complexity. Performance corresponds to whether 
right flows are preempted to result in the minimal bandwidth to ac- 
commodate a new flow. Complexity corresponds to the amount of 
information needed for preemption decision. Preemption is known 
to be NP-complete |8 1. The complexity results from a large num- 
ber of active flows supported by a core network for which preemp- 
tion decisions need to be made. For example, for a IGbps link, if 
the bandwidth of each flow is in the order of Kbps, there would be 
thousands of flows supported per link. In addition, a flow gener- 
ally passes through multiple nodes, making preemption decisions 
among nodes dependent and thus difficult to be done with local 
information. Thus preemption is network-centric, and may require 
a huge amount of information to perform in a large network. 

For centralized preemption decisions, a centralized node main- 
tains the routed-path information of active flows, their priorities 
and bandwidth occupancies at the entire route. The centralized 
node then decides which active flows to preempt upon the request 
of a new call. Therefore, centralized preemption can always be op- 
timal, resulting in minimal preempted bandwidth. But the amount 
of management information needed can be overwhelming at the 
centralized node. For example, let Ft be the total number of dis- 
tinct flows per priority class at the route of a new call. Each flow 
has two states, preempted or not preempted. The total number of 
possible states is 0(2^' ) for making a centralized decision. When 
Ft is in the order of hundreds or thousands lfT4l . centralized pre- 
emption becomes computationally intractable. Decentralized pre- 
emption is then adopted for reducing the amount of management 
information li27l . 

^The preempted flows are usually rerouted to other paths. Hence preemption 
and rerouting can be considered jointly with somewhat different objectives 1 30 1 . 
This work, however, focuses on preemption on a given path without considering 
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Decentralized preemption is done at each node individually, and 
thus requires a node to maintain its local information, i.e., active 
flows at the adjacent links, their priorities and bandwidth occu- 
pancy. Such information is available locally at nodes. A node then 
decides, independently from the other nodes, which connections 
to preempt. This, however, may cause conflicting local decisions 
on the same flows that pass multiple links on the route, resulting in 
more preempted bandwidth than necessary. In other words, decen- 
tralized preemption decision neglects the spatial dependence for 
the flows across multiple links, and may perform poorly. But the 
amount of management information are greatly reduced compared 
with centralized preemption. 

For example, let F be the maximum number of active flows 
per link. The total number of states is 2^ at each link. Since 
2^ <C 2^', compared with centralized preemption, decentralized 
schemes have a much smaller search space for preemption 
decisions. Therefore, most algorithms in the literature focus on 
decentralized preemption (see 1251 ||27| and references there in). 

This work studies distributed decisions, that take into account 
spatial dependence among neighboring links through local 
information exchange. In fact, distributed preemption can be 
considered as a generalization of centralized and decentralized 
preemption. Centralized preemption corresponds to one extreme 
case of distributed preemption that an entire route is the neighbor- 
hood for information exchange; whereas, decentralized decisions 
correspond to another extreme case where the neighborhood 
size is zero. Therefore, the communication complexity can be 
characterized in terms of neighborhood size. There is a trade-off 
between the optimality and the complexity. 

In general, it has been shown to be a difficult problem to develop 
a distributed algorithm whose performance is predictable and 
within a tolerable degradation (i.e., given error bound) from that of 
the optimal scheme [31] ■ Hence, the open issues are: (a) When can 
distributed decisions collectively result in a near-optimal global 
preemption? (b) How to model a large number of dependent 
decision variables and to obtain near-optimal local decisions using 
distributed algorithms? We apply machine learning to study these 
issues. 

Machine learning perspective: A machine learning view of 
distributed preemption is that individual nodes "learn to make de- 
cisions" collectively and iteratively. Ideally, if each node has com- 
plete information on all active flows at the route of a new flow, the 
node will be able to make correct decisions on which flows to pre- 
empt. However, at any given time, a node has only partial informa- 
tion on the active flows on the route and its neighbors' decisions on 
the flows to preempt. But a node can adapt, i.e., learn to make de- 
cisions based on those of its neighbors'. As neighbors learn from 
neighbors' neighbors, a node would indirectly learn what farther 
nodes decide only with a delay. Eventually, all nodes would make 
local decisions, collectively resulting in a near-optimal preemption 
at the entire route. 

How would machine learning benefit distributed preemption? 
The problem of collective learning and decision-making has been 
a keen interest in machine learning and adaptive control ID ifTOl . 
but has just begun to see applications in networking. In particu- 
lar, Q proposes using Markov Random Fields as a general model 
of decision-making in Ad hoc wireless networks. The model is 



1221 obtain probabilistic graphical models for ad hoc wireless and 
wireline networks starting from network properties l22lll23l , and 
the resulting probabilistic models turn out to be multi-layer. This 
work focuses on distributed decisions on network flows. We view 
machine learning as a framework in which a large number of de- 
cision variables can be treated jointly. Spatial dependence among 
these variables poses a key challenge to preemption, is an origin 
of high communication complexity, and has not been dealt with 
sufficiently in prior works. Machine learning provides feasible ap- 
proaches for this problem as summarized below. 

(a) Global model of distributed preemption decisions: We first 
develop a probabilistic model that represents explicitly the spa- 
tial dependence of distributed preemption decisions over a pre- 
determined preempting route of a new flow. The randomness re- 
sults from randomly arriving/departing active flows and their loca- 
tions. The preemption decisions made on flows at each node are 
also random due to incomplete and inaccurate local information 
for distributed preemption. We first obtain a cost function for pre- 
emption as a "Hamiltonian" (or "system potential energy") 1271 . A 
Hamiltonian combines local preemption decisions and constraints 
into a single quantity. The constraints include link capacity, un- 
used bandwidths and bandwidth-demand of a new flow at each 
link. The Hamiltonian is then used to obtain a spatial probabilistic 
model as a Gibbs distribution |9|. 

(b) Markov Random Field (MRF) and sufficient conditions: 
Spatial dependence can be characterized through a probabilistic 
dependency graph of graphical models Il9J[ 17|| 19| in machine 
learning. A probabilistic dependency graph provides a simple yet 
explicit representation of the spatial dependence among random 
variables. We show that if the dependence of decision variables is 
spatially Markovian, a globally optimal preemption decision can 
be obtained collectively by iterative local decisions through infor- 
mation exchange only with neighboring nodes. Such a probabilis- 
tic model is known as a Markov Random Field ||9| . 

In general, distributed decisions may not be spatially Markov, 
since the spatial dependence is caused by flows across multiple 
links. Hence we identify traffic patterns of active flows that result 
in approximately spatial Markov dependence. We then define the 
near-optimality of distributed decisions as the difference between 
the centralized and distributed decisions, measured in the Hamil- 
tonian, and obtain sufficient conditions for the difference to reside 
within an error bound. 

(c) Distributed Decision Algorithm: A near-optimal distributed 
algorithm is derived based on the Markov Random Field. The al- 
gorithms can be implemented through either message passing |,19ll 
or Gibbs sampling |9|. 

(d) Trade-offs: A challenging issue is the performance- 
complexity trade-off, i.e., "when" and "how" distributed preemp- 
tion can achieve a near-optimal performance with a moderate com- 
plexity. Here the performance measures the optimality of dis- 
tributed preemption decision relative to that of the centralized op- 
timal decision. The communication complexity of distributed pre- 
emption can be characterized by the amount of information used 
in distributed decision making. Distributed decisions reduce com- 
plexity using information exchange only with neighbors, but may 
deviate from the optimal performance. Hence we study perfor- 
mance and complexity trade-off through both analysis and simula- 
tion. 
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vides a problem formulation on connection preemption. Section 
Uni develops a probabilistic spatial model of distributed preemp- 
tion, utilizing the graphical models in machine learning and inter- 
preting the derived model in terms of optimality and complexity. 
Section HVl proposes a distributed preemption algorithm based on 
the derived model, using probabilistic inference. Section [V] ana- 
lyzes the performance of distributed preemption. Section |VI] val- 
idates the performance of distributed preemption through simula- 
tion. Section [VTIl provides a further literature review and discus- 
sions. Section lVIIII concludes the paper. 

II. Distributed Preemption 

A. Example 



S 
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(b) Preemption decisions at each hop over the route of a new connection 
Fig. 1 

Example of preemption 

Figure [T] (a) shows an example network, and Figure [T](b) illus- 
trates distributed preemption on a given route. Assume that a new 
call request is made on the route between the SD pair, and for sim- 
plicity, all flows have the same bandwidth. To accommodate the 
new flow, the centralized preemption would preempt two existing 
flows, i.e., flow 1 and flow 2 that are marked with X. Such a pre- 
emption decision is obviously optimal. Now consider distributed 
preemption. In reality, distributed preemption decisions are made 
at nodes. For notational convenience, we regard the decision is 
made at the link which connects the left node. For example, link 
2 is the link beginning with node 2. The local information avail- 
able at node 2 includes the priorities and the bandwidths of flows 
1, 3 and 4 that pass through this node. When the bandwidth is the 
same for all flows, node 2 may decide to preempt flow 4 without 
knowing that nodes 1 and 3 both decide to preempt flow 1 . Such 
a decision would result in more flows to be preempted than nec- 
essary compared with the centralized decisions. In contrast, node 
2 would choose to preempt flow 1 if node 2 also has the informa- 
tion on the decisions made by the neighbors (nodes 1 and 3). This 
example shows the following facts. 

(a) Local decisions are spatially dependent. The spatial depen- 
dence originates from flows that trespass multiple nodes, and link 



(b) The spatial dependence can be taken into consideration by 
exchanging local decisions among neighbors. How many nodes 
should exchange local decisions depend on extents of flows. 

(c) The information exchanged would result in consistent deci- 
sions across a network, and thus improve the optimality of local 
preemption done at nodes. 

These facts motivate cooperative distributed preemption formu- 
lated below. 

B. Problem Formulation 

Assumptions: We assume that a preempting route Rp is pre- 
determined for a new connection [8| [27|, and composed of node 
1, • • •, and node L + 1. We assume that the traffic flows on the 
route belong to multiple priority classes 1, • • •, imax, and a new 
connection belongs to class i„eto and demands bandwidth Cnew ■ 

Variables: Let Sp be a set of all active flows on route Rp, where 
Sf = {/\ • • • , /l^^'} with \Sf\ being the cai'dinality of set Sp- 
f'^ and B'^ denote flow k on the preempting route and its band- 
width. We consider the decisions at links. For notational conve- 
nience, without causing any confusion between nodes and links, 
we denote link i as the link between node i and node z + 1 for 
simplicity for 1 < i < L, where L is the number of links on the 
considering route. Let fi be the set of all active flows at link i for 
l<i< L. 

Let denote the preemption decision on flow k at link i for 
1 < i < L. c?f=l if link i decides to preempt the flow; c?f=0, 
otherwise. Let di denote the set of local preemption decisions on 
all active flows at link i. Let d denote all local decisions on the 
route, then d = {(i'^,for 1 < k < I-S'fI}, where d*^ denotes the 
preemption decision on flow k over the entire preempting route, 
d'^ = 1 — HiiLill ^ df). Hence, = 1, i.e., flow k is preempted 
from the given route, if at least one link decides to remove the 
flow, and d''' = if all links decide to keep the flo\\0. Hence, 
d*^ = 1 is a global decision of the entire path, and a local decision 

(1 < i < L and k £ fi) impacts the global decision. 

Problem statement: Assume that the following information is 
maintained at node i {1 < i < L + 1): (a) complete local in- 
formation on the active flows at link i which includes flow ID k, 
class priority of flow fc, bandwidth of flow k B'', for k E fi and 
1 < k < \Sf\', and (b) neighbor information that includes deci- 
sions from the neighboring links within iVrf hops for Nd > 1. 

Given ak (ak > 0) as the priority weighjjof flow k, and B^ > 
as the amount of unused bandwidth at link i, for 1 < i < L, 
preemption is to obtain a set of decisions d*^'s that 

minimize ^ UkB'^d'^ (1) 

l<k<\SF\ 

subject to 

d'l e {0, 1}, for 1 < i < L, 

where v4i = S'^d*^ + 5° is the total available bandwidth at link 
i for the new flow. The constraint requires that the sum of the un- 
used bandwidth before preemption and the preempted bandwidth 

^Note that preemption in a general context can be considered as removing a high 
piiority flow to the best-effort class rather than completely terminating the service 
for the flow. 

•^For example, if flow k\ and k2 belong to class 1 and 2 respectively, ai^, < 
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at any link i should be sufficiently large for accommodating the 
new flow. 

Goal: The goal of this work is to approximate this global op- 
timization problem through distributed preemption decisions. In 
particular, the first step is to derive a distributed algorithm that 
obtains a set of local preemption decisions made at links through 
information exchange with neighbors. The second step is to ob- 
tain near-optimality conditions under which distributed decisions 
approximate the globally optimal preemption decisions. 

Note that the objective function in (HJ is the cost correspond- 
ing to the total preempted bandwidth. Such an objective function 
is used by most of the existing works ||271 |[8| ||29l . Since d'' = 
1 — J|^^(l — d^),a flow is preempted if at least one node decides 
to preempt the flow, i.e., — for at least one i and a given k. 
Hence, the global optimization in ^ requires making consistent 
decisions on the same flow at all links over the preempting route, 
i.e., (if 's are all equal for a given k. 

Distributed preemption is to determine {df } fori <i<L and 
fc e /j that minimize ([T]i using the local and the neighbor informa- 
tion. Hence the objective of distributed preemption is still global 
over the entire path. But the management-information exchange 
is local for making preemption decisions. Hence a key chal- 
lenge is how to model and coordinate a large number of spatially- 
dependent local decisions to achieve the global objective of pre- 
emption in a fully distributed fashion. 

Table 1 summarizes our notations. 

TABLE I 

Notations 



4 


Local preemption decision made at link i for flow k 


d^ 


Global preemption decision of flow k of the path 




Bandwidth of flow k 


L 


The total number of links at the path 



HL Probabilistic Spatial Model of Preemption 
Decisions 

We begin by developing a global model to represent the spatial 
dependence of a large number of distributed preemption decisions. 
We then derive a local model as an approximation. The global and 
local models are developed through probabilistic graphical models 
in machine learning. 

A. Global Model 

A global model should include accurate spatial dependence re- 
sulting from flows, objectives and constraints on distributed pre- 
emption decisions. 

1 ) Deterministic Flows 

We first consider an example with a given set of active flows. 
Let H{d) be the cost for setting up a new connection of a high 
priority. We express H{d) by expanding the cost from ([T]) and 
using the Lagrangian multiplier for the constraints, 

H{d) = Va,B'=(V4- (2) 



L 

+/3^(7(c„e» -Ai), 
1=1 

where /3 > is a Lagrangian multiplier, 1 < ii i2 1^ L, 
1 < k < \Sf\, U{x) is an indicator function: U{x) = if a; < 0; 
1, otherwise. The /3-term corresponds to the capacity constraint in 
O- 

Hence minimizing ^ corresponds to deterministic optimization 
(refer ([T]i), and conceptually, centralized preemption can always 
find an optimal set of flows to remove. 

Distributed preemption allows each link i to update its decisions 
iteratively and asynchronously based on local information (Ai) and 
neighbors' decisions (d*^, j G Nd, where Nd is a neighborhood 
of link i). Each link/node can only access limited and initially 
inaccurate information from near-neighbors and missing informa- 
tion from far-neighbors. But through neighbor's neighbors, such 
information would eventually propagate to all nodes, resulting in 
globally consistent decisions. A difficulty is that deterministic dis- 
tributed decisions may get stuck at a local optimum ||9l. 

2) Random Flows 

What and how many flows are active at which Hnks are related 
to user behaviors and thus random. Hence active flows and their 
aggregation at individual links should be regarded as random vari- 
ables. Preemption decisions made on active flows should be con- 
sidered as random also. A set of decisions thus form a sample 
space = {d}, a subset of which consists of events due to dis- 
tributed decisions. A given set of decisions on a given set of flows 
is then a sample realization of an event. One such sample is shown 
in the example of Figure[Tl where d={d\, df, d^, d"^, d\, d\, d\, d\ 
}= {1, 0, 1, 0, 1, 0, 1, 0}. This relates random and deterministic 
flows and decisions. 

To obtain an optimal set of preempted flows, stochastic rather 
than deterministic optimization should be used, and this requires 
obtaining a probability distribution of d. 

Such a probability distribution can be obtained through graphi- 
cal models defined on neighborhood systems jO). A neighborhood 
system can be characterized by Hamiltonian which is also called 
system potential energy |,21,| . The energy of a decision variable 
corresponds to a per-variable preemption cost, akB^d^_^, in the 
first terms of (|2|i. Interactions between decision variables of any 
two different links result in V , / j, ahB^ d^ as the 
second terms of (|2|i. The remaining terms correspond to higher- 
order interactions. 

In this context, H{d) corresponds to a Hamiltonian of d, and 
results in a Gibbs distribution ||9l II2TII . 

P{d) = ^-^exp , (3) 

where T is a parameter (the temperature |'9l), and Za is a nor- 
malization constant. The Gibbs distribution is a probability dis- 
tribution of decision variables, and thus provides a mathematical 
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This Gibbs distribution also provides the implementation method- 
ology of near-optimal distributed and iterative preemption deci- 
sion, which is shown in Section |IV] The minimum of the Hamil- 
tonian corresponds to the optimal preemption decisions that max- 
imize the probability. 

3) Probabilistic Graphical Models 

The spatial dependence among a large number of decision 
variables can be represented explicitly by probabilistic graphical 
models. A graphical models relates a probability distribution of 
random variables with a corresponding dependency graph ||9l ifTTl 
|fT9l . A node in the graph represents a random variable and a link 
between two nodes characterizes their dependence. In particular, 
a set of random variables v forms Gibbs Random Field (GRF) if 
it obeys a Gibbs distribution f2T\. A Gibbs distribution satisfies 
the positivity condition, meaning that all decisions have a positive 
probability. One other important property is the spatial Markov 
dependence defined by the neighborhood system and shown by 
Hammersley-Clifford theorem. 

Hammersley-Clifford Theorem H21\I : Let S be the set of nodes, 
S = {1, ■ ■ ■ , N}. Let V be a set of random variables, v = 
{vi, ■ ■ ■ ,vn}. 

V is said to be a Markov Random Field if (i) P(v) > for V v 
in sample space; (ii) P(vi\vj for j £ S\{i}) = P(vi\vj for j G 
Ni), where Ni is a neighborhood of node ifor i £ S. 

The random field v is also said to be a Gibbs Random 
Field if its probabilistic distribution can be written in the form 
P(v)=Y[^^(j Vc{v), where c is a clique, C is the set of all feasible 
cliques, and Vc{v) is a general positive function called a clique 
potential function. 

There is an equivalence between a Gibbs Random Field and a 
Markov Random Field if and only if the Gibbs distribution P(v) 
possesses the spatial Markov property. 

Markov Random Fields correspond to an interesting type of 
probabilistic graphical models where a random variable is condi- 
tionally independent of the other nodes given its neighbors. The 
conditional independence is spatially nested, i.e., a node depends 
on its far neighbors through neighbors' neighbors. Such nested 
dependence can be observed explicitly through local connections 
among nodes in a dependency graph. The corresponding Gibbs 
distribution is thus factorizable over clique potentials |9|. 

An important implication to distributed preemption is that if dis- 
tributed decisions result in an MRF, local decisions using neighbor 
information are collectively optimal. But do preemption decisions 
d form a Markov Random Field in the first place? We plot the 
dependency graph for the Gibbs distribution with the Hamiltonian 
in (|2]i. In particular, a factor graph |fT9]| in Figure|2]is used to draw 
the dependency graph of the decision variables for the set of flows 
shown in Figure [T] 

A factor graph is a bipartite graph that expresses the spatial de- 
pendence between the variable nodes and the function nodes (19). 
In Figure IH circles represent the decisions on the active flows at 
links. Squares denote the link-functions, corresponding to the lo- 
cal potentials at individual links. Specifically, gi{d) is a local po- 
tential that encompasses the flows passing through link i. A con- 
nection between a circle and a square indicates a functional rela- 



link 1 and extends to link 2 and 3. Multiple variables relating to 
the same local potential are dependent, e.g., d\ and are depen- 
dent through gsid). This is because flow 1 and flow 5 pass the 
same link 3 and are thus dependent due to the limited capacity 
constraint. Meanwhile, different local potentials can be dependent 
if they share some flow-variables, e.g., gi{d) and (73(d) are both 
connected to c?J and dj. This is because flow 1 passes link 1 and 
link 3. This shows the global dependence, resulting from long 
flows which extend to far neighboring links/nodes. 
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Fig. 2 

Spatial dependence of decision variables 



Such statistical dependence can be represented quantitatively by 

local potentials X^ii^i = H{d) as in (IH, for example, gi{d) 
in Figure|2]is a local potential that encompasses the flows passing 
through link 1, i.e., 

gi{d) = a\B^ (di — d\d2 — d\d\ + di 0(2^3) + azB^di 
+/9t/(c„e„ - ^1), 

where ai=a^ because both flows 1 and 3 belong to class 1. 

Hence the graphical and the mathematical representations show 
that in general a decision random variable at a flow (circle on the 
graph) can have connections with both near and far neighboring 
local potentials (squares on the graph). This implies that in gen- 
eral, the decision variables are not spatially Markov, and the Gibbs 
distribution is thus not factorizable. 

B. Local Model 

If the long-range spatial dependence can be removed from the 
probabilistic dependency graph, the spatial dependence can be ap- 
proximated through a spatial Markov model, i.e., a Markov Ran- 
dom Field. Such a Markov Random Field considers only depen- 
dence of decision variables with their neighbors, resulting in a 
truncated Hamiltonian as follows, 

H\d) = E"^^'(E<"E E '^^'^^ 

k ii ii |i2— *ll< — -^d 

L 

+P^U{Cne.w - Ai), 
i=l 

where 1 < ii ^ 12 ^ L, 1 < k < \Sp\, and denotes the 
neighborhood size of a node. 

The corresponding Gibbs distribution is 

P'(d) = Z-exp(^-^^^. (4) 
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(d) is an approximated likelihood function. 



T 



(5) 



where exp ^— is a local likelihood function for the con- 
nections at link i, and can be further decomposed into all clique 
potentials associated with connections at link i: 



exp 



-9^id) 
T 



exp 



T 



(6) 



where Ci is the set of all cliques of link i, and ^c{d) is a potential 
function of clique c. 

For example, if the neighborhood size = 1 for all links, the 
corresponding factor graph has only nearest neighbor connections 
as shown in Figure [3] where the dash lines denote the neglected 
dependency links. 




Maximizing local likelihood functions can be implemented as 
local learning algorithms at individual nodes. The learning algo- 
rithms perform probabilistic inference using either approximated 
sum product algorithm lfT9l or stochastic relaxation l9] . The sum 
product algorithm can be applied to the factor graph in Figure |3] 
This algorithm produces an exact solution for a graph that has no 
loops. However, the factor graph ||T9l of preemption problem is 

usually loopy, resulting in approximated (non-optimal) decisions. 
Stochastic relaxation can be applied for each link to make local 

preemption decisions. Let df'^^^''^ be a set of decisions on active 
flows at link i, excluding the decision on flow k. Here Sp \ {k} 
denotes a set operation, which excludes k from Sp- Now we add 
time variable t to the decisions 0, and let (t + 1) be an updated 
decision on flow f'^ at the {t + l)th iteration and at link i. Then, 

4'(i + l) = l, (8) 

with the probability 

p' (dUt + i) = i|<\^'=^t),div,w) = 

exp(-V>i(d,^(t + l) = l)/r(t + l)) 

E,,^(t+i)e{-i,i} '^'^P {-MdHt + ^))int + 1)) ' 

where 

V.(df(t + l) = l) = afeS*- ^ + + 



Fig. 3 

Localized spatial dependence of d with Factor Graph 



IV. Distributed Preemption Algorithms 

We now assume that a local model is obtained as a good ap- 
proximation of the global model. The spatial Markov local model 
then can be used to derive a distributed algorithm where nodes can 
make local decisions on connection preemptions through informa- 
tion exchange with neighbors. 

A. Distributed Algorithm 

The distributed algorithm obtains a set of local decisions that 
maximizes the approximated likelihood function, which is equiva- 
lent to minimizing the cost function. 



d 



argmax P^{d) 
d 

argmini/'(d). 
d 



(7) 



Since (d) is factorizable, maximizing the global likelihood 
function reduces to maximizing the local likelihood function at 
cliques, i.e., P\di\dNi) for I < i < L, where d^^ is the set 
of decision variables of neighboring links. As these local likeli- 
hoods are functions of the decision variables of neighboring links, 
the decisions can be updated locally. In addition, the local maxi- 
mizations result in coupled equations due to the nested Markov de- 
pendence, which shows that information exchange is needed only 



This means that a random decision at time epoch t + 1 is made 
based on local information df'^'^^'^^{t) and neighbor information 
cZjVi (t) at the previous time epoch t. A cooling schedule is applied 
to the temperature T{t) = ro/log(l + t) with ro=3.0. This re- 
sults in an almost-sure convergence of the algorithm to the global 
minimum Hamiltonian (i.e., optimal decisions) [9]. That is, with 
the iterative and distributed updates, the global minimum of the 
approximated Hamiltonian (d) can be reached asymptotically 
with probability one. 

B. Example 

We now revisit Figure [T] to show an example of the distributed 
algorithm. Consider links 1 and 2, and assume that the neighbor- 
hood size Nd=\- That is, a node only exchanges information with 
its nearest neighbors. 

At initial stage, no flows are preempted, i.e. {d\ (0)=0, d\ (0)=0, 
d^(0)=0, d^(0)=0}. When t = 1, the decision variables are up- 
dated, 

d}(l) - ai-g max P(d}(l) = d|d?(0), djvi (0)), 
tie{o,i} 

where d^iiO) = {^2(0)' '^2(0)}- The updated decision d\{l) is 
sent to the neighboring links. This process is applied similarly to 
the other decision variables. At the second time epoch (t = 2), 



d-{2) 



arg max P(di(2) 
de{o,i} 



d|4(l),djv.(l)), 



where = { (/}(!), 4(1), 4(1)}. 

The process is repeated until an equilibrium state (i.e., of no 
more changes) is reached. 
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C. Information Exchange 

The distributed preemption decisions require information ex- 
change with neighbors. The clique structure of the Markov Ran- 
dom Field determines the range of information exchange, which 
is the neighborhood size Nd- The type of the information ex- 
changed is binary, i.e., di^^{t), as in the conditional probability 
in (|8]l. The amount of information used at a decision making char- 
acterizes the communication/computation complexity. The infor- 
mation exchange is per-flow based but moderate when limited to 
neighbors. 

V. Near-Optimality and Complexity 

In this section, we conduct analytical studies to identify suf- 
ficient conditions for the near-optimality of the distributed pre- 
emption, the communication/computation complexity, and the 
optimality-complexity trade-off. 

A. Short-Range Dependent Decision Variables 

The near-optimality is in regard to the question when distributed 
preemption decisions are nearly optimal. To answer this question, 
we need to consider how well a Markov Random Field approxi- 
mates the global model. This should be done by studying the traffic 
patterns of active flows since the flows across multiple links over 
the preempting route and the limited link-capacity constraints at 
links are the origins of spatial dependence of distributed decisions. 

1 ) Bounded-Length Flows 

Traffic patterns of active flows result in spatial dependence 
among distributed decision variables. Consider simplified traffic 
patterns where the hop-count of each active flow is bounded by 
h for h > 1. Then the set of distributed preemption decision 
variables are strictly Markov as shown below. 

Lemma 1 : Assume that the hop-count of each active flow is 
bounded by h (h > 1). Let Nj^ be a set of neighborhood of link 
i and include all links within h hops from (i,j). Let d^f^={dm, 

forym G Nj^} denote a set of decisions in the neighborhood, and 
d\di be all decision variables except di. Then, P(di\d\di) = 
Pidild^J. 

The proof is provided in Appendix U Lemma 1 shows that the 
set of decision variables on active flows of a limited span forms 
a Markov Random Field (MRF), where h corresponds to an upper 
bound of the neighborhood size of the MRF. This is intuitive as the 
active flows of a bounded length would only introduce short-range 
spatial dependence. 

2 ) Shortest-Path Flows 

In reality, however, the hop-count of active flows is a variable 
and cannot be assumed to be bounded with a meaningfully small 
value (e.g., 1 or 2 hops). Thus, we study the spatial dependence 
of decision variables for shortest-path flows that constitute more 
realistic traffic patterns. In particular, we consider shortest-path 
flows with the following assumptions for analytical convenience: 

(1) A network is planar and homogeneous where each node (ex- 



(2) A source-destination pair is chosen randomly from all pairs 
in the network. 

(3) A preempting route is a shortest-path between the source and 
the destination of a new connection. 

(4) Active flows are assumed to take shortest routes from 
randomly-chosen source-destination pairs whose paths may par- 
tially coincide with the route of the new connection. 

(5) The route length of a new flow is assumed to be L hops for 
analytical simplicity, where L > 1. 

We now define a measure of spatial dependence of two links on 
the preempting route (e.g., link (i — and + 1) for 2 < i, 

and i < j < L). 

Definition 1. Link-Dependency Probability Piji Pij denotes 
the probability that a flow uses both link (i — and (j,j + 1) 
that are separated by \ j — i\ hops on the preempting route. 

The link-dependency probability Pij then characterizes the 
spatial dependence of any two flows at these two links. Pij is 
difficult to obtain exactly, and thus bounded as follows. 

Lemma 2: Let P^ be a lower bound of Pij, i.e., Pij > Pij. For 
shortest-path flows under assumptions (1) through (5), Pjj = 

y L >^do-l> 

The proof is provided in Appendix HI] This lemma suggests 
that the length of a shortest-path flow over the preempting route 
follows at least a geometric probability, where -j^^ is the lower 
bound of the probability for such a flow to continue at the next hop. 

Lemma 3: Let P^j be an upper bound of Pij, i.e., Pij < P^j. 
Consider a network topology of a regular lattice with nodal degree 
4. For shortest-path flows under assumptions (1) through (5), 

p}'. = ) ^ L ' 2(2lJ-l-i) ' 

where C{a,b) = j^^-^^^ is a combinatorial coefficient, and 
\j -t\>2. For \j -t\»l,P:;^ (^^4^) , /,V -I - 

The proofs can be obtained by counting the number of shortest 
paths between node i and j, and is given in AppendixHlll 

Figure |4] depicts both the upper and lower bound as well as an 
empirical probability Pij . The probability Py is estimated on a 
regular lattice network with 250 nodes, where active flows are 
routed onto the shortest paths between randomly chosen source- 
destination pairs. 10 runs are conducted and the results are aver- 
aged to obtain the empirical probability. As shown in the figure, 
Pij decays rapidly close to the exponential decreasing rate of the 
lower bound P/, . 

Lemma 2, 3 and the empirical result suggest that on the average, 
shortest-path flows share only few hops with the preempting route. 
Thus, as we shall soon see, Markov Random Fields is a good ap- 
proximation to a set of decision variables d for a mesh topology. 

B. Sufficient Conditions for Near-Optimality 

We now define the near-optimaUty of distributed preemption. 
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Fig. 4 

Upper and lower bounds of the probability that a flow visits 
both links (i — and (j ,j + 1) on the preempting route 



Definition 2. Near-optimality of distributed decisions: Consider 
a given route of a S-D pair of a new connection. Consider random 
flows on the route that obey Pij, i.e., a flow would continue at \j—i\ 
hops from the source with probability Pij. Let d* and d be two 
sets of preemption decisions that minimize the global Hamiltonian 
HQ and its approximation H^Q, respectively. The optimality of 
distributed decisions d is measured by the expected value of the 
difference, i.e., £'(A), where A = \H{d*) ~ H{d)\. The expectation 
is over random flows and randomized decisions. 

Given a desired performance e > 0, if E{A) < e, d is 
near-optimal. 

We now derive sufficient conditions for the near-optimality. 
This suffices to investigate whether and when the long-range 
dependence of active flows can be neglected in the global Hamil- 
tonian. For feasibility of analysis, we consider the traffic patterns 
with a geometric probability drawn from Lemma 2 and 3. Such 
traffic patterns exhibit a certain practical relevance as shown in 
the above section, and is also analytically tractable. 



near-optimality needs to specify when the effects of aggregated 
long flows are negligible in the truncated Hamiltonian. For 
feasibility of analysis, we consider a simplified scenario that the 
bandwidths of active flows are bounded. 

Theorem 1: Let Bq > be a constant bandwidth. Consider 
a straight route of a new flow. For given esfO < es < Ij, 
assume that bandwidth of flow k satisfles \ ^ r,^" | < cs for 



all k. Then E{A) < 2c„ 



■L-[{1+ p^) 



So 



1], 



where Nd is the neighborhood size for information exchange 
in distributed preemption decisions. When p^'^L = o(l), the 
upper bound of 2c„e„ • ■ L ■ [{I + p^'^)^''^" - 1] = 



2c 



The proof is provided in Appendix |IV] Theorem 1 provides 
the following observations when active flows follow a geometric 
distribution. 

(a) For a given pc and Cnew , the larger the neighborhood size Nd 
in the Markov Random Field, the smaller the upper bound, and the 
better the performance may be for distributed preemption. In fact, 
the error bound decreases exponentially with respect to Nd- 

(b) The upper bound increases linearly with respect to the band- 



width demand of a new flow c„ 



characterizes the 



Boll- 
maximum number of active flows to accommodate the new flow 

with Cnew at a link. Thus, the larger the Cnew, the more existing 
flows may need to be preempted, the higher the probability for the 
distributed algorithm to make inconsistent decisions at links. That 
is, the performance of distributed preemption may degrade when 
the bandwidth demand of a new flow increases. 

(c) The upper bound also increases with respect to the route 
length L, since a longer route consists of more links and thus a 
higher probability for distributed decisions made at links to be in- 
consistent. 

It should be noted that the above studies of the optimality as- 
sume that the stochastic relaxation is capable of obtaining a global 
minimum of the global and local models. This holds true as the 
convergence of the algorithm occurs almost surely |i9J . 

C. Complexity 

A key advantage of distributed preemption is the reduced 
complexity, i.e., the information exchange is limited to only 
neighbors. 



Definition 3. Flow-continuity probability pc is the probability 
that an active flow continues onto the next link on the preempting 
route. 

The notion of flow-continuity probability pc has been used in 
two other contexts to describe the extent of an optical flow Q ll20l . 
As shown in Lemma 3, pc characterizes the range of dependence 
of active flows, and corresponds to a special case of Pij . In fact, 
if a flow continues with probability pc at each link independent of 
the other links, the length of an active flow would obey a geometric 
probability ||20ll . For example, pc = -d~i for the lower bound of 
the continuity probability of the shortest-path flows over a planar 
lattice topology with node degree d^. 

Even when a large number of flows are short-range dependent. 



Definition 4. Communication Complexity ( CC): Let Nd denote 
the neighborhood size for exchanging binary information (bits) in 
distributed preemption. Let fmax denote the maximum number 
of active flows at a link. Let iter denote the total number of 
iterations needed for the distributed algorithm to converge. 
Communication complexity ( CC) of a node is deflned as the 
total amount of information exchanged for a link to make a de- 
cision using the distributed algorithm, i.e., CC = 0{NdfmaxHer)- 

Note that 0{Nditer) is the bits of information exchanged for 
making a preemption decision on one flow. There are at most fmax 
flows at a node. This results in CC = 0{NdfmaxHer)- Hence, if 
iter can be bounded by a moderate value, as shall be shown in 
Section VI, CC would be 0{Ndfrnax) which grows linearly with 
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We now compare qualitatively CC with decentralized preemp- 
tions. Min-Conn and Min-BW [27| are the representatives of the 
existing decentralized algorithms that minimize the number of pre- 
empted flows and the amount of preempted bandwidth at each hop, 
respectively, without information exchange. The complexity of 
Min-Conn and Min-BW are 0{ff^^^) and 0(/toq2;2-^""^ ), respec- 
tively. 

Hence by bounding iter with a moderate value, we can obtain 
a globally near-optimal decision that is obtained with a smaller 
complexity than that of decentralized algorithms. We shall show 
this in the next section. 

We now compare qualitatively CC with centralized preemp- 
tions. The communication/computation complexity for a central- 
ized scheme increases linearly with the number of hops on the 
preempting route L. Hence, if << L, the complexity for dis- 
tributed preemption is much smaller than that of the centralized 
preemption due to local information exchange among neighbors. 

D. Optimality and Complexity Trade-off 

How large should be for a given L, traffic pattern (pc) and 
other parameters? Theorem 1 shows that reducing N^, i.e., the 
communication/computation complexity, results in a simpler local 
model but a larger error bound. Therefore, a trade-off between the 
optimality and complexity needs to be explored. 



log 



Corollary 1: For a given performance e, ;/ Nd > 



log -L 



then E{A) < e. For L large, L :$> Nd 



and > small, the condition reduces to Nd > ^ { !°^f I ^ 



The proof of Corollary 1 can be obtained directly from Theorem 
1 by letting E[A] < e and simple algebraic manipulations, and 
thus omitted. 

The corollary implies that a sufficient condition for distributed 
preemption decisions to be near optimal is for Nd to be of an order 
log L, when the flow continuity probability decays exponentially 
with the number of hops. Meanwhile, the larger and the smaller 
e are, the larger Nd is. This shows clearly a trade-off between the 
performance and complexity. 

VI. Simulations 

We now study further how the performance of distributed deci- 
sions varies with respect to neighborhood size and traffic patterns 
through simulation. We also compare the distributed preemption 
with the methods used in the prior work. 

A. Performance Metrics and Simulation Setting 

We use the average preempted bandwidth per link at a chosen 
path, X)/c B^d!', as the performance metric in our simulation. 
The metric is used to quantify the effectiveness of distributed pre- 
emption in bandwidth savings. 

Our simulation generates a network topology, a certain traffic 
pattern, and a chosen route. To be specific, our simulation studies 
consider a network with two service-classes. The capacity C of 



are uniformly distributed between 1.25 and 2.5 Mbps, and 2.5 and 
37.5 Mbps, respectively. 

We use both mesh and power-law topologies in the simulations. 
The power law topology has 80 nodes generated through BRITE 
f5lf6l. The mesh topology is generated as a planar lattice topology 
with 100 nodes. The nodal degree of a lattice topology is do = 4 
except the edge nodes. This results in flow-continuity probability 

Pc=\- 

The flows of each service class are evenly distributed over the 
network. The arrival and the departure flows of each class follow 
a Poisson distribution with arrival rate (A^) and departure rate (/li), 
fori = 1,2. 

The source and destination of a new connection are chosen at 
random in a network. The resulting route has about 10 hops on 
average for the S-D pair of each active flow. 

We conduct over 10 experiments with random initial conditions 
and get the averaged values as results. For each run, active flows 
are routed over the shortest-path between S-D pairs and accepted 
if bandwidths are available, so the network is heavily populated 
with active flows. The distributed algorithm is used to obtain a set 
of local decisions. The preemption decision for the flows on the 
path is then obtained according to Section |IV] The performance 
metrics are averaged over all runs. 

B. Performance and Neighborhood Size 

A new connection setup assumes bandwidth demand c„e«;=20 
Mbps and class inew = 2. Distributed preemption decisions are 
made by ^ changing neighborhood size 1 < Nd < 2. Decentral- 
ized decisions are implemented using Min-Conn algorithm ll27ll 
for comparisons. There, each node makes decentralized preemp- 
tion decisions independently without any cooperation with neigh- 
bors. The performance in the average preempted bandwidth per 
link is shown in Table Ull and HiH for distributed preemption of dif- 
ferent neighborhood sizes and decentralized preemption. For both 
topologies, the preemption costs are reduced sharply with the co- 
operation with neighbors. 

For planar lattice topologies, the nodal degree do=4 results in the 
flow-continuity probability pc=l/3. The link-dependency proba- 
bility Pij thus decreases in 0(p!/ ''). As shown in Table HH even 
with the smallest neighborhood Nd — 1, the preemption cost of 
distributed decision can be reduced by 50% compared to the de- 
centralized preemption algorithm. 

For power-law topologies, nodes have different degrees, and the 
path-length of a connection is around 2 or 3 hops on the average. 
Thus, the link-dependency probability Pij also decays sharply. 
The effectiveness of the distributed preemption is similar to that 
of planar lattice topologies. 

TABLE II 

Preemption Costs on a Planar Lattice Topology of do=4 



Decentralized Preemption 


Nd=l 


Nd=2 


16.7 


7.6 


6.3 



C. Neighborhood Size and Traffic Patterns 

We now study how the performance of the distributed preemp- 



10 



TABLE III 

Preemption Costs on a Power-Law Topology 



Decentralized Preemption 


Nd=l 


Nd=2 


17.2 


8.8 


7.5 



each experiment, active flows are generated randomly for each 
flow-continuity probability pc and neighborhood size Nd- This 
is repeated for a wide range of pc and Nd values. The preempt- 
ing route has L = 10 hops, and a new connection has bandwidth 
demand c„eto=20 Mbps. 

Figure |5] (a) shows that the preempted bandwidth decreases 
sharply by including the information only from the nearest neigh- 
bors. This is especially significant for a small pc (e.g., pc=0.3), 
which corresponds to short flows. Nd=Q corresponds to decentral- 
ized decisions where there is no information exchange with neigh- 
bors. Hence the figure shows that the cooperation with the nearest 
neighbors (i.e., Nd=l) can improve the performance by 53%. 

The cooperation with farther neighbors (e.g., Nd=4-) results in 
another 3.3% bandwidth saving for pc — 0.3. But the improve- 
ment is not significant given the increase of communication com- 
plexity. Hence, for short flows, the information exchange between 
the nearest neighbors seems to be sufficient to achieve the near- 
optimality. 

As Pc increases, the dependence among decision variables on 
different links increases, and the performance gains are more pro- 
nounced with a larger neighborhood size. Figure |5](b) shows that 
the preempted bandwidth decreases linearly with an increase of pc 
for a given Nd- This is because that the dependence of two links 
increases along with pc- Thus, for a given Nd, the amount of pre- 
empted bandwidth decreases with an increase of pc- 

Figure |5] (b) also shows that the preempted bandwidth of the 
distributed preemption is smaller than that of Min-Conn 12711 algo- 
rithm. The complexity of Min-BW algorithm is 0{fmax ■ 2^™"= ), 
which is computationally intractable for fmax large. Thus, 
Min-Conn algorithm (whose complexity is 0{f^^^)) is used for 
comparison. 




(a) Distributed preemption: Varying TV^j for different pc 




(b) Distributed preemption, compared with decentralized algoritiim in 1271 . 

Fig. 5 

Average preempted bandwidth, with c„c^=20 Mbps, link capacity 
C=100 Mbps, and L=10 hops on the preempting route 



D. Path Length 

Now we consider the impact of path length L for fixed pc=QA. 
Other parameters used are c„etu=20 Mbps, and C = 100 Mbps. For 
a given Nd, as L increases, the preemption cost decreases. 

Figure |6] (a) shows that for all Nd values, the corresponding 
preempted bandwidth decreases as L increases. However, the de- 
crease of preempted bandwidth is lower bounded for L > 30 hops, 
such as Figure|6](b). 

E. Bandwidth Demand 

Now we consider the impact of bandwidth demand Cnew of a 
new connection together with the neighborhood size. Other pa- 
rameters are fixed and chosen as pc=QA, and C = 100 Mbps. 

Figure [T] shows that for all Nd values, the corresponding pre- 
empted bandwidth increases with Cnew linearly. Moreover, the 
gain of the distributed preemption is significant compared with de- 



VII. Related Work and Discussions 

Connection preemption: Garay and Gopal (HI show that cen- 
tralized connection preemption is NP-complete, and thus develop 
heuristic algorithms. The algorithms consider all connections on a 
preempting route but are not optimal in preemption decisions. 

Peyravian and Kshemkalyani f27l propose two practical decen- 
tralized algorithms. The algorithms incorporate multiple factors 
such as the priority of each connection, the bandwidth and the 
number of connections to be preempted. The algorithms are lo- 
cally optimal at each link but not globally optimal at the entire 
route since there is no information exchange among links. Oliveira 
1^251 formulates connection preemption through linear program- 
ming and proposes adaptive heuristic algorithms. The lower- 
priority connections are then preempted to minimize the impact 
to the active bandwidth. The optimality is not considered there. 

Stanisic [ 29,1 develop two randomized preemption algorithms 
whose complexity is linear in the number of lower priority flows. 
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(b) Compaiison of distributed and decentralized preemption: Varying L 
Fig. 6 

Average preempted bandwidth. c„em=20 Mbps, C=100 Mbps, and 

Pc=0.4 



et.al. Il24ll considers soft preemption that utilizes a preemption 
pending flag to mitigate gracefufly the re-route process of pre- 
empted LSP. 

As the existing algorithms provide important empirical results, 
the issues of optimality and complexity have been studied insuf- 
ficiently. Most of these algorithms do not consider information 
exchange among neighbors. 

Distributed Management: Flow preemption can be cast in 
a general context of distributed management. Distributed algo- 
rithms and protocols have been designed using local information 
on configuration management (see ll32l and references therein). 
Self-organizing protocols are proposed for sensor networks and 
developed for p2p self-stabilizing using graph coloring (see ifTsl 
and references therein). The distributed management, however, 
has been traditionally done based on heuristic local rules. Near- 
optimal distributed algorithms remain to be an open problem when 
spatial dependence is non-Markovian. 




c [Mbps] 



Fig. 7 

Comparison of distributed and decentralized preemption 



els have been used to represent the spatial dependence in complex 
systems. Markov Random Fields have been used to model the 
cooperation of mobile agents Q in addition to their wide applica- 
tions in image processing |9^|. Factor graphs are used to represent 
the dependence and as computationally efficient distributed algo- 
rithms |19|. 

One related work is a cross-layer graphical model developed 
for optical networks [22], where a Markov Random Field models 
the spatial dependence of routes at the network layer The other 
related work is our prior work where the probabilistic graphical 
models are used for the distributed management of wireless ad hoc 
networks lfT6l . The work presented in this paper, however, consid- 
ers a different problem of flows in a different setting of wireline 
networks. 

Optimality and Complexity: Optimality in terms of network 
capacity has been investigated for wireless ad hoc and sensor net- 
works |28|. However, near-optimal algorithms have been studied 
little for controlling flows. 

VIII. Conclusions 

In this work, we have studied distributed connection preemp- 
tion in multi-class networks. The work is motivated by the fact 
that connection preemption is known to be NP-complete. Central- 
ized preemption can achieve an optimal performance but has an 
intractable communication complexity. Decentralized preemption 
is computationally feasible but lack of a good performance. This 
work has focused on whether a near-optimal performance can be 
achieved by distributed preemption at a moderate communication 
complexity. We have developed a distributed framework, where 
nodes make local preemption decisions through cooperation with 
neighbors. The framework treats distributed preemption as a ma- 
chine learning problem where a large number of statistically de- 
pendent decisions can be treated jointly. 

Specifically, we have developed a probabilistic spatial model 
of distributed preemption decisions. We have shown that a suf- 
ficient condition for distributed preemption to be near-optimal is 



12 



identified a cause of spatial dependence which is due to flows tres- 
passing muhiple links. We have examined commonly-used traffic 
patterns including short-range dependence flows and shortest-path 
flows. This results in a certain sufficient conditions on the near- 
optimality. In particular, the sufficient conditions quantify joint 
impacts of the flow-continuity probability, the bandwidth demand 
of a new flow, the communication complexity of distributed algo- 
rithms, and route lengths for short-range dependent active flows. 

We have shown that the spatial dependence can be characterized 
by probabilistic graphical models. The graphical models allow 
us to apply distributed algorithms based on stochastic relaxation. 
We have shown through analysis and simulations that for short- 
range dependent flows, information exchange with only the nearest 
neighbors can significantly improve the performance of preemp- 
tion. The use of more neighbors result in further improvements but 
not as pronounced given an increase in the communication com- 
plexity. 

More general traffic patterns require a further study. It re- 
mains open how to quantify the validity/invalidity of distributed 
preemption for long-range dependent flows. For example, the 
long-range spatial dependence cannot be ignored in linear or ring 
topologies, and thus the neighborhood for distributed preemption 
is not locaUzed. Hence, distributed algorithms need to be extended 
to multiple routes and other traffic patterns in a more realistic 
network setting where the impact of topologies may become 
significant. From a computational standpoint, one disadvantage of 
distributed preemption using stochastic relaxation is a slow con- 
vergence. Hence future work also involves a study of computation 
time in terms of delays. 
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Appendix 

I. Proof of Lemma 1 

Consider two links i and m. Let \i — m\ denote the distance of 
these two links. The distance of two links indicates the hop-counts 
of the shortest path between them. 

When \i — m\ < hg, there can be a flow that shares both links. 
Otherwise, i.e., |i — m| > ho, there cannot be any flow that shares 
both links. Therefore, the decisions at link i are conditionally in- 
dependent of the other links beyond ho hops. 

II. Proof of Lemma 2 

As given in assumption (1), there are multiple routes from node 
i to node j on the planar regular topology, one of which is the 
preempting route. Since each node has nodal degree do, the prob- 
ability is ( ^ ) for a connection that passes link {i — 1) on the 
preempting route to trespasses through the next link i. Thus, the 
probability that an active flow trespasses links i — 1, ■ ■ ■, j on the 
preempting route is (g^zrj- . 

Assume that link i — 1 is the first link that an active flow en- 
counters the preempting route. Consider the fc-th link from the 
source node of an active flow. Assume that an active flow first 
meets each link on the preemption route equally likely. Then the 



of an active flow is for 1 < k < L. Thus, the probability that 
an active flow meet on link z — 1 at the preemption route proceeds 



to link j is ( 



), where \j — i\ is the number of hops be- 



tween link i — 1 and link j. Therefore, a lower-bound of is 



L-\j-i\ 



III. Proof of Lemma 3 



Consider two nodes i and j on the preempting route. As each 
connection is assumed to follows the shortest path between a 
source-destination pair, we obtain an upper bound of the proba- 
bihty Pij by counting the total number of shortest paths between 
node i and j with \ j — i\ hops. 

The total number of shortest paths from node i to j with \ j ~ i\ 
hops is always upper bounded by C{\j ~ i\, ■'^y^). To be specific, 
a shortest-path from node i to j is composed of k horizontal and 
|j — i| — fc vertical hops, and the total number of shortest paths 
from node z to j is C (|j — i|, fc), for 1 < k < \j — i\- Evidently, 
C{\j -i\,^)=max{C{\j -i\,k),foTl<k< Thus, 
an upper bound of the number of shortest paths from node i to j 
with \j — i\ hops is C(|j — i\, ^^^). 

Now, consider a set of nodes which are separated from node i 
by \ j — i \ hops over the shortest paths. We count the total number 
of shortest paths from node i to this set of nodes. Starting from 
node i, we can reach one of such nodes by taking r horizontal 
steps and |,j — i| — r vertical steps, for 1 < r < \j — i\. For 
instance, with all positive vertical and horizontal steps (i.e., not 
going backwards), the number of shortest paths with the distance 
of I j — i\ hops from node i is X^r Ji*' ^ili ^ 'I; From binomial 
formula, X^HTi' ~ ^L^) - S'^^*' — 1. However, there are 
four combinations about the same directions of vertical/horizontal 
steps. The nodes that are located on the line of radian 0, tt, and 
^ centered at node i are counted twice. Thus, a lower bound of 
the total number of shortest-paths from node i to the set of nodes 
is 2(2l^-*l - 1) for \j - i| = 2, and 3(21^-^1 - 1) for \j - i\ > 2. 

An upper bound of probability Pij is the ratio between an upper 
bound of total number of shortest paths from node i to j, and a 
lower bound of total number of shortest paths from node i to the set 
of nodes that are \ j — i\ hops away. Moreover, as shown in Lemma 
2, the probability that an active flow on link (i — proceeds 

further on the direction of the new flow by | j — z | hops is ( ) . 
Thus, 



V L ) 3(2lJ-*l-l) ' \J *l ^ ^• 



(10) 



Now consider |j — «| >> 1. Using StirUng formula, n! 



27rexp(-n)n"+°-^ Thus, C (\j - i\, 

1 

2^27v\j-i\ ' 



pu — 1 

U ~ ^ L / 2, /27rli-il ■ 



and 



IV. Proof of Theorem 1 

Consider a set of randomly generated active flows and a ran- 
domly chosen preempting route of L hops. 

Then, A= \H{d*) - H{d)\. To find an up- 

per bound of A, we have \H(d*) — H{d)\ = 
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< \{H{d*)- H^{d*)) + [H{d*)- H\d) 

equality holds because H{d*) < H{d) and H\d) < H'{d*) by 
definition of d* and . 

Assume that for the global and local optimal decisions, d* and 
d', the second-order consistency is achieved as shown in the con- 
straint in That is, — for |« — j| < Nd, and for 
1 < A; < \Sf\- Since df^'s are binary, the second-order consis- 
tency implies all orders up to L — consistency. That is, for a 
given link i. Consider a set 5'^ = ii, • • • , i/ that contains indices of 
any other links within Nd hops of link i. 

Thus, 



Here the in- with the preempting route is — Pc) p, 



2„b"-it + K-i|-2 



d'^ 



"■ii- 



E{\H{d) - 
/2 + /a + • 



H\d)\)+E{\H{d* 



H'{d*)\) 



(11) 



where 



»1 »27^»l.|*l-j2i<-'Vd 



(12) 



E 



»!, - 1 7^»l , I »1 -«L - 1 I <-'V<j 



for|^l-^^| < A^^, for / = 2, • • • , L. 

Hence, within a neighborhood A^^j of any given link, all links 
make consistent decisions. Beyond such a neighborhood, links 
can make different and thus incorrect preemption decisions. To 
bound the error, we let da be a feasible preemption decision with- 
out distinguishing whether it is globally or locally optimal. Con- 
sider an active flow k on link i of a preempting route. For the local 
model H'-{da) of neighborhood size Nd, the continuity of the ac- 
tive flow is neglected beyond the neighborhood Nd, i.e., on the 
links {i + Nd + m) or (i — Nd — m) for m > 1. Hence, the error 
caused by using the local model can be counted by the neglected 
active flows beyond the neighborhood. 

Specifically, from I2, the error caused by a flow k that leaves 
at link i + Nd + m corresponds to the bandwidth B'' of flow 
k, and the probability that the flow leaves at this portion of the 
path is p^'*+'"~^(l — pc), for m > 1. Therefore, for one active 
flow on the preempting route, the total expected error of ignor- 
ing the second-order terms for Vm > 1 is less than B'^ J2m=i 
{L ~ Nd)p^^+"'-\l - Pc) < B'^p^^iL - Nd). 

Thus, for a feasible configuration da, the expected error caused 
by neglecting the second-order terms (i.e., I2) for all flows is upper 
bounded by 2^^^^^p^^ Bo{l + eB)(L - Nd), where B^\rZs) 
denotes the maximum number of flows feasible to be preempted at 
a link. 

Similarly, from the third term (i.e., /s), the expected error 



for 

|j — i| > Nd and |Z — i| > Nd, where (1 — Pc)pc ^ is the 
probability a flow continues to the link j from link i and then 
exists the route. Since there are at most i^^.^'^) such terms for 
each flow in the third term of (|4|l, this error is upper bounded by 

A similar bound can be obtained for the m-th-order term of (|2]|, 
for TO = 2, • • • , L. Thus, let A ~ 2cnewjz^, then 



£(A) < E{\H{d)-H'{d)\)+E{\H{d*)-H'{d*) 
< A{L - Na) 

' " - 1 



+Pc 

< AL((l+pf<')^-^'' - 1) . (13) 
For p^-'L = 0(1), the bound is AL{L ~ Nd)?^" + o{Lp'^'^). 
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